Implementing a Quantum Algorithm with Exchange- Coupled Quantum Dots: a 

Feasibility study. 
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We present Monte Carlo wavefunction simulations for quantum computations employing an 
exchange-coupled array of quantum dots. Employing a combination of experimentally and theo- 
retically available parameters, we find that gate fidelities greater than 98 % may be obtained with 
current experimental and technological capabilities. Application to an encoded 3 qubit (nine phys- 
ical qubits) Deutsch-Josza computation indicates that the algorithmic fidelity is more a question of 
the total time to implement the gates than of the physical complexity of those gates. 

o 
o 

^ . I. INTRODUCTION 

o 

D ' Quantum information processing has been characterized by a rapid pace of theoretical progress and slower devel- 
opment of experimental techniques. At the current time, the differential is clearly visible: while most experimental 
realizations to date are limited to a handful of qubits or less, much theoretical effort is devoted to elucidating the 
potential of quantum computers with thousands or even millions of qubits. As experiments progress there is a need 
to evaluate the many suggested experimental implementations, to determine if they are feasible as proposed. The 
PsJ ' evaluations need not be elaborate, just the calculation of some figures of merit that indicate the technology's compu- 
^ , tational merit. As an example of such a calculation, we present here simulation results and fidelities for the CNOT 
gate and for the three qubit Deutsch-Josza algorithm implemented in a model system of a spin-coupled quantum dot 
] array with exchange-only quantum computation. CliSi^) 

The possibility of employing coupled quantum wells for quantum information processing was first proposed by 
Landauer, and Barenco et al. in the mid-1990's.'i^^ Numerous researchers have since elaborated upon these ideas. 
We investigate here the quantum dot implementation proposed by Loss and DiVincenzo.'^-* As suggested by these 
authors, we consider a linear quantum dot array where each dot arises as a localized region within a two dimensional 
^ , electron gas, with the localization imposed by electrical gating. Each qubit is realized as the spin of an unpaired 
^ • electron on the quantum dot. Following Ref. '7", we shall assume the spin-orbit interaction is negligible, and the effect 
I [ of surrounding nuclear spins will be incorporated into the error terms. Thus, the spin of each electron constitutes a 
^ . well-defined two dimensional Hilbert space. Employing spin rather than orbital degrees of freedom greatly reduces the 
I effects of decoherence, since the spin states couple much less strongly to the environment than the charge states. 'i^^ 
^ : Whereas the charge degrees of freedom are characterized by a decoherence rate on the order of nanoseconds'iS^ , the 
spin degrees are relatively resistant to errors, and dephasing rates in the microsecond range can be expected. ^'^Q-^ It 
^ , should be noted that these figures stem from experiments done under non-decoherence suppressing conditions, and 
one might expect that with more elaborate experimental set-ups, e.g. taking advantage of spin polarization and 
spin echo techniques, the effects of inhomogeneous broadening as well as the hyperfine coupling between the electron 
and surrounding nuclear spins can be reduced. To date, there has been no actual experimental realization of the 
" " ' spin-coupled quantum dot array. However, demonstrated experimental ability to control the coupling between the 
dots'iiiii^-' and observations of coherent and long-lived spin oscillations^^ indicates that a quantum computer as 
envisioned by Loss and DiVincenzo could be realizable. 

Assuming that tunable gates, well-defined arrays, and reasonable control of decoherence processes have been 
achieved, the issue of measurement remains. Less work has been done on the realization of quantum measurement in 
these systems, although efforts towards experimental realization of single spin measurements in the solid state have 
been made.'ii^ii^*i^'' Since at the outset of these simulations no experimentally demonstrated measurement scheme 
existed, we have made the simplest assumption of noiseless projective measurements. 

In this work we employ the isotropic exchange interaction for coupling quantum spins with the exchange-only 
quantum computation scheme of Refs.^|3) and0- Use of an isotropic exchange interaction amounts to an idealization 
of the system as it generally exists in an experimental setting. For real quantum dots, some exchange anisotropy may 
arise from the effects of the finite spin-orbit coupling, "i^^ This anisotropy may be dealt with by modified encoding'iSiii^ 
or by pulse-shaping techniques. '■iSr^ Both of these approaches can in principle be applied to the simulation analysis 
presented here. 



2 



II. UNIVERSALITY THROUGH EXCHANGE 

A criteria for any quantum processing device is that two qubits can be coupled in a non-trivial way. In the quantum 
dot arrays under consideration here, this coupling arises from the nearest neighbour spin-spin interactions described 
by the Heisenberg exchange Hamiltonian:'i&^ 

i7y = J(t)S, -S,, (1) 

where Si = 1/2 {(Tix,(Jiy,(Tiz) are the spin operators at site i, with the Pauli matrices, and J{t) is the exchange 
coupling strength. Loss and DiVincenzo have shown that when the exchange interaction is pulsable (i.e., it can be 
switched between finite "on" and low or zero "off" values) this exchange interaction allows the implementation of 
V SWAP^ which, in conjunction with local unitaries, is equivalent to XOR (also known as CNOT).'i^-' The switching is 
controlled by the integrated coupling strength J J{t)dt — Jotg over the pulse duration tg, with the SWAP gate being 
achieved for Jpts = 7r(moc?27r) and V SWAP for one half of this. To facilitate calculations with this spin Hamiltonian 
it is convenient to rescale it by addition of a unit operator, to arrive at the exchange operator Eij 

Eij — 2{Si ■ Sj + -I4) 

= ^{cr^<E)(TJ+I,(E)Ij), (2) 

where I4 is the 4-by-4 identity matrix for the two-qubit Hilbert space. This rescaled operator acts to exchange the 
states of physical qubits i and j. In the rest of this paper all exchange gate times pertaining to single or two qubit 
gates will be given based on this exchange operator, unless otherwise indicated. As such they will be expressed in 
units of 2h/ Jq. 

Unfortunately, in the solid state one-qubit gates are often technically more demanding than two-qubit gates. Ci) Thus 
even the simple one-qubit gates required to obtain the XOR from SW AP require a high degree of experimental 
and technical sophistication. Kempe et al. showed that the Heisenberg interaction can be made universal by itself, 
with the use of a suitable encoding ("encoded universality" For a linear quantum dot array as that considered 
here, one possible encoding is to represent each logical qubit using three physical qubits: 

\0)l = -^(1011) -1101)), 

|1)l = y||110)--^ (1011) + 1101)). (3) 

Note that total spin along the z-axis, 5**°* = J2i ^i-zj is conserved and equal to i, and that the states, in addition, are 
eigenstates of S^. The computational basis is |00)l, |01)l, |10)li ^^nd |11)l. With this encoding, an S'C/(4) operation 
that is locally equivalent to CNOT between logical qubits, has been shown to be feasible with 19 exchanges'^ between 
adjacent pairs of physical qubits. This operation is illustrated in Fig. ^ 

It should be noted that 19 is an upper limit on the number of serial exchanges required, obtained through numerical 
optimization of the Makhlin invariants'i^-' , and it is possible that a smaller number might suffice. The number of 
exchange operations can also be reduced if one considers a more complex quantum dot architecture that supports non- 
nearest neighbour interactions, rather than the linear array studied here. In particular, if in addition to exchanges 
Hii+i, any Hij connection is accessible, then full parallelism is possible. With parallel exchanges of this type a 
one-qubit rotation can be shown to require only 3 exchanges, and a CNOT gate only 8 exchanges. 'ii^ 

Universal quantum information processing will require that we can implement the CNOT gate between any two 
logical qubits, and not just between adjacent qubits. This need is apparent even for such simple algorithms as the 
three qubit Deutsch-Josza algorithm that tests whether an arbitrary function is balanced or const ant. '^^'^^ We shall 
see below that one of the black box gate sequences describing a balanced function for the three qubit Deutsch-Josza 
algorithm requires a CNOT gate between qubits 1 and 3. We shall refer to this as CN0T(1,3). Analysis of truth 
tables for combinations of CNOTs between neighbouring dots readily shows that a CN0T(1,3) gate is equivalent to 
a pairwise sequence of four CNOT gates between adjacent dots, as illustrated in Fig. [5] 

It would be perfectly vahd to employ this pairwise adjacent sequence to represent CN0T(1,3). However, since each 
CNOT requires 19 exchange operations, this would entail implementing a total of 4 * 19 = 76 exchange gates. Here, 
recognition of the relation between exchange and SWAP operations allows for a more efficient solution. We note that 
any exchange gate when applied for a duration equivalent to a tt pulse, yields the SWAP operation, and that exchange 
gates may be inter converted by action of the appropriate SWAP operations, e.g., 



exp {itHis/h) = exp (iTTH23/h) exp (itHi2/h) exp {iTTH23/h). 



(4) 
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FIG. 1: Circuit diagram for implementing a CNOT gate between logical qubits 1 and 2. Logical qubit 1 is encoded in the 
top three physical qubits, 1-3, and logical qubit 2 in the bottom three physical qubits, 4-6. Each vertical line connecting two 
physical qubits represents an exchange gate with strength Jo switched on for time ts. As a result of the symmetry of the gate 
sequence, only 7 independent variables are required to define the gate times of the entire 19-gate sequence.'—' 
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FIG. 2: Equivalence of circuit employing only CNOT gates between adjacent logical qubits, to a circuit representing a CNOT 
between qubits 1 and 3. 

Combining an exchange gate between logical qubits i and z-|-l with exchange-generated SWAP operations immediately 
before and after, thereby makes the exchange operational between any two qubits i and j. This is summarized by the 
following: 



fc=j-i 



(5) 



k=i+l 



Making use of this relation, we find that a CNOT gate between qubits 1 and 3 can now be performed with only 55 
exchanges, an overall saving of 21 operations compared with the pairwise adjacent scenario depicted in Fig|21 
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The Makhlin invariants' ^^-^ guarantee a sequence of exchange gates and times that provide a two-qubit unitary 
which is locally equivalent to the CNOT. This is not necessarily equal to CNOT in the computational basis. 



exchange 
cnot 



To use our exchange-only CNOT in conjunction with other gates, we therefore need to find a set of local unitaries that 
transform this locally equivalent gate to the CNOT in the computational basis. Mathematically, we can represent the 
transformation as: 



CNOT = {Ui 



U2) ulZt"""'" {Vi ® V2) 



(6) 



Here Ui, U2, Vi, and V2 designate local basis transformations each consisting of at most 4 exchange gates'i-' that act 



on the first or the second logical qubit. For U, 
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we employ here the optimized sequence of 19 exchange gates 



from Rcf. j. It is then possible, using a procedure introduced by Makhlin'-'^^^ to find the local unitaries {Ui, U2, Vi, 



and V2) by i) recasting U[ 
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in the Bell basis, Mb = Q^U, 



exchange 
cnot 



Q, ii) evaluating the spectrum of to = M^Mb, 



and iii) then relating this matrix of eigenvalues to the corresponding matrix for the CNOT gate in the computational 
basis. Here Q is the matrix that transforms U'^not'^^^^ from the computational basis to the Bell basis, and is the 
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matrix transpose of Mb- Having obtained the local unitaries Ui and Vi, i = 1, 2, it remains to decompose these into 
an actual sequence of exchange gates in order to perform full exchange-only computation. We describe here two ways 
to accomplish this decomposition. The first is a general procedure based on numerical optimization. The second is 
an analytic procedure that is specific to the present case, since it relies on the ability to find analytic solutions to 
systems of trigonometric equations that may be less tractable for other situations. 

For both approaches we give the times tg for the individual exchange gates in units of 2?i/ Jq. We cite all time values 
as positive numbers here, and implicitly assume that the pulse-integrated exchange coupling value Jq is constant. (In 
Ref. 0, Burkard et al. consider the possibility of tuning the Heisenberg interaction around the point of zero coupling, 
thus allowing Jq to assume both positive and negative values.) Note that all gate times are defined modulo tt, i.e. an 
exchange gate iniplemented for tg is identical (up to a global phase of -1) to one implemented for tg + ir (see discussion 
above and Refs. |g and|3). 

For the numerical optimization procedure, we first express the local basis transformations, J7i, J72, Vi, and V2, as 
sequences of 4 exchange gates. We then use a version of the Nelder-Mead simplex algorithm '<2i^ to find those sequences 
that minimize both the matrix distance from the resulting CNOT gate Eq. jnj to the true CNOT in the computational 
basis, and the extent of leakage out of the encoded subspace. The Nelder-Mead algorithm is an example of a direct 
search method, i.e. it uses only function evaluations and does not rely upon any derivative information about the 
cost function. Each iteration begins with a geometric figure, a simplex, created from m + 1 coordinates in parameter 
space, where m is the number of variables of the cost function to be optimized. From this first simplex, new points 
are generated and the cost function is evaluated at these new coordinates. A new simplex, possessing better descent 
characteristics than the previous, is then generated from the cost function evaluations and the new test points.'^— 
The Nelder-Mead method is known to work well in low-dimensional instances such as those studied here. The local 
character of the method is nevertheless a concern. To avoid getting trapped in local minima, the parameter space 
must be densely sampled. We accomplish this here by shooting initial coordinates into parameter space, followed by 
optimization from the coordinate that gave the smallest initial value of the cost function. 

As noted above, our cost function contains two components. First, it includes an element by element matrix 
equivalence criteria, i.e. the matrix distance between the target gate (the true CNOT) and the candidate exchange- 
only representation of this, Eq. ||HJ), which is constructed from the 19-exchange sequence of Ref. |li together with two 
8-exchange sequences representing the local unitaries. Second, it contains a component that provides a penalty for 
leakage out of the encoded subspace, namely the sum of the absolute value of all matrix elements connecting states 
in the encoded logical subspace to states outside this subspace, illustrated in Fig.|21 
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FIG. 3: To ensure that our encoded unitary does not leak, we require that the encoded unitary, [U\ U2) Ul^'^"'"^" (Vi (8 V2), 
can be represented by the above matrix decomposition. The logical Hilbert space has dimension 2" x 2" for n logical qubits. 
The complete Hilbert space has dimension 2^ x 2^, corresponding to N physical qubits. 

These two requirements of minimal matrix distance from the target CNOT and non-leakage from the encoded 
subspace lead to the following expression for the cost function: 
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where n is the number of logical qubits and N the number of physical qubits. The first summation represents the 
matrix distance between Eq. © and the target CNOT, and the second summation, which goes over the lower left- 
hand block of the matrix in Fig. 13 represents the extent of leakage. The second summation thus consists of terms 
connecting basis states j < 2" inside the encoded logical subspace to all basis states 2" < z < 2^ outside the encoded 
logical subspace. 

Since we employ the 19-exchange sequence for f/cnot"'"^'^ from Ref. 1, the numerical optimization is restricted 
here to the two sets of 8-exchange sequences representing Ui (g) U2 and Vi V2, respectively. We implement this 
by constructing a sequence of 35 gates (i.e., 4+4+19+4+4, see Fig. 0)) with gates 9-27 taken from Ref. [ij, and then 
optimizing the cost function C, Eq. {Tj), for the matrix resulting from the entire 35-exchange sequence only over the 
16-dimensional parameter space of the two 8-exchange sequences. We note that since the available nearest neighbor 
exchanges E12 and E23 correspond simply to rotations of the logical states around the z-axis and about an axis 
oriented along ^f^j^Ox + 1/2(Tz, respectively'i^^ the exchange-based local gates will therefore not take states outside 
the logical subspace and will hence not add to the leakage term. The leakage parameter is therefore determined solely 
by the accuracy of the underlying gate sequence for t/cnof 

We have found that for such a 35-gate sequence, the overall cost function C can readily be reduced to less than 10"'' 
by this numerical optimization. More specifically, we find that the matrix distance (first term of Eq. 01 between this 
35 gate long sequence and the exact CNOT is 1 x 10~^ and that the leakage term (second term of Eq. [Tj) is 5 x 10"^ 
(equal to the value for C^c^of""^"^, as noted above). This level of optimization corresponds to a maximum element 
matrix distance from CNOT (ie., maximum matrix element inaccuracy) of 5 x 10~^. The optimal exchange-only 
sequences for the local transformations are summarized in Fig. 0| and in Table ^ respectively. 
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FIG. 4: Diagrammatic representation of one set of exchange gates that transform the 19-exchange gate CNOT {U^^^^"'"^'^) 
into the exact CNOT. These 8+8 gate times (ti-tg and t^s-ts^) were arrived at numerically using the Nelder-Mead simplex 
method'^^^ as described in the text. The corresponding 35 gate times given in TableQwill generate the exact CNOT to within 
a cost function value C = 10~^, corresponding to a maximum matrix element distance of 5 x 10"^. 

The second approach to finding an exchange-only representation of the local unitaries is analytic solution through 
matrix manipulations, as follows. We first analyze the similarity transformation S that diagonalizes U^^^f from Ref.yJ 
in the computational basis, i.e., S''U^!^^^^ angeg _ jj ^Yieve D is a diagonal 4-by-4 matrix. We found that S can be 
expressed as local operations on the logical qubits, namely S — I ® (^^1 — fo'y) • Mapping the SU{2) spin rotations 

to rotations in 50(3) and using the quaternion representation for 5*0(3) rotations (see Appendix ^ for details), we 
find that this similarity transformation can be realized using only 3 exchange gates. From the diagonal matrix _D, 
one can then readily generate a C-PHASE gate in the computational basis by merely performing rotations around 
the z-axis. Transformation of the resulting C-PHASE into the desired CNOT is subsequently realized by acting with 
Hadamard gates on the second logical qubit both before and after the resulting C-PHASE. These elementary gates 
are summarized in Fig. Both the Oz rotations and the Hadamard gate have analytic exchange-only solutions on 
the encoded subspace (see Sec. IH Bl and Fig. [Tjl. We thereby arrive at an alternative, fully analytic solution for an 
exchange-only realization of the local unitary transformations into the computational basis, requiring a total of 33 
exchange gates. These may be reduced to a total of 30 exchanges by combining the times of any sequential exchanges 
on the same pairs of qubits that occur as a result of juxtaposition of elementary gates. Consequently the desired 
overall transformation can now be completed with only 11 more exchanges than the underlying 19-exchange sequence 
for Knit- Fig. El and Table El summarize the resulting gate sequence and gate times, respectively, for the exact 
CNOT deriving from this analytic solution of the local transformations. 

The analytical sequence of Table IhI result in a maximum matrix element deviation of 5.5 x 10^^ from the true 
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(a) A single qubit gate (-^ ~ ^'^y) acting on the second logical qubit 
diagonalizes the 19-gate exchange sequence. The resulting diagonal 
4-by-4 matrix is then converted into the C-PHASE by cr^-rotations 
acting on both the first and the second qubit, with angles ip = 0.612497 
and 6 = —0.547580, respectively. These values are determined from the 
analytic solutions to a linear equation system with 3 unknowns: </>, 6 and 
a global phase. See Appendix^ for details as to how these parameters 
were obtained. 
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(b) The C-PHASE gate can be transformed into the CNOT gate by 
acting with Hadamard gates on the second qubit before and after the 
C-PHASE gate. 



FIG. 5: Representation of analytic sequence of local transformations that transform the 19-exchange sequence U^^^'^'^"^" from 
Ref. Q into the true CNOT in the computational basis. The exchange gates and times corresponding to the elementary local 
transformations are then obtained using the quaternion representation of the desired SU (2) unitaries (see Appendix ^ for 
details). 

CNOT. Thus, our analytic solution has similar accuracy as the numerical solution above. However, the 30-exchange 
analytical sequence in Fig. |^ and Table UTI represents a saving in both total number of gates and total time, relative 
to the 35-exchange sequence of Fig. H and Table D). The 30-exchang e sequence requires total time T^q — 43.373, 
compared with Tas ~ 54.326 for the 35-exchange sequence. This is an advantage for experimental implementation, 
since the shorter time allows for less decoherence. 



B. Single Qubit Gates through Exchange 



By considering the action of the Heisenberg Hamiltonian on the encoded subspace, Eq.^l it was found numerically in 
Ref. 1 that arbitrary single-qubit gates can be performed on the 3-qubit encoding using 4 nearest neighbour exchanges 
in serial operation mode, or by using 3 exchanges in parallel. 

The exchange gate times tg for a particular single-qubit gate can in principle be found by solving a system of 
equations with four unknowns. Specifically, given a single-qubit gate A, we consider the sequence of 4 exchange gates 



A = exp {-itiEi2/K) exp {-ihE23/K) exp {-it2Ei2/K) exp {-itiE2z/K) 
and solve the 4 coupled equations for the 4 times is, s = 1 — 4: 



(8) 



Aqo = exp {i{t2 + ti)) [ cosits) - ^sin{t3) j ( cos{ti) - ^sm(ti) 



■ exp (i{t4 — t2))sin(ti)sin{t3) 
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FIG. 6: Diagrammatic representation of a second, shorter set of 30 exchanges that transform the 19 exchange-only CNOT into 
the exact CNOT in the computational basis. Here C^cnot"^"'"^ corresponds to the 19 gate sequence of Ref. Q. The remaining 
gates and gate times, t\-tQ and t2&-t3o, correspond to the local unitaries {Ui, U2, Vi, and V2) which were arrived at analytically 
by decomposing the local unitaries into a sequence of elementary rotations in SU{2) as summarized in Fig. 1^1 and then using 
the quaternion representation of the corresponding rotations in 50(3) to find the sequences of exchange gates and the times, 
ts, that generate them (Appendix 0. The resulting gate times are listed in Table El 
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(9) 



Here we have used the properties of the exchange operators E12 = —<Tz and £'23 = \fi/2<7x + l/^^z in the logical 
basis'i^ . 

In this work we used a quaternion approach to represent the single qubit gates as rotations in 50(3), rather than 
solving the above coupled equations. Obtaining the exchange gate times in the quaternion representation also requires 
solving trigonometric equations in multiple unknowns, but these equations, especially for simpler gates, are often more 
straightforward than the matrix equation above, and analytic solutions easier to obtain. We first recognize that the 
combinations of the two exchanges E12 and i?23 generate the encoded single qubit operations and Ux , and thereby 
will sufhce to generate any arbitrary rotation on SU{2). A single qubit gate is then mapped from SU{2) to S0{3)^^^ 
where the desired rotation can be decomposed as a sequence of quaternions. The quaternion approach is convenient 
for finding an analytic solution for realization with a given number of exchanges, as described in Appendix IXI 

Using this approach we found exchange-only gate sequences for the n/8 gate (the T gate*^^), the NOT gate, and 
the Hadamard gate. A full description of these solutions is given in Appendix ^ We found that both the Hadamard 
and the NOT gate can be obtained from a sequence of three exchange gates, while the tt/S gate requires only one 
exchange gate. The corresponding gate sequences and gate times are shown in Fig. [7| (Note that any ct^ rotation by 
9 {Rz(9) = exp{—i6az)) can be realized as exp{iOEi2)). 

A third approach to finding the exchange-only implementation of the single qubit gates is through a Nelder-Mead 
simplex numerical optimization'iSiiSS) , as implemented for the local transformations in the previous section. Though 
not analytic, the Nelder-Mead approach is often much faster than analytic solutions and for single qubit gates the 
cost function can readily be reduced to zero at the machine precision level. 
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(a) The Hadamard gate can be implemented 
with 3 exchange gates. 



(b) The T gate (T=7r/8 rotation about z) can be 
implemented with just one exchange gate since 
Ei2 = — o"z, where refers to a z-rotation in 
the encoded logical subspace. 
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(c) The NOT gate can also be implemented with 
3 exchange gates 



FIG. 7: Exchange-only representation of encoded single qubit gates can often be arrived at from geometric considerations. The 
exchange-only representations of the Hadamard gate, the tt/S, and the NOT gate given here were obtained using a quaternion 
representation for the corresponding rotations in 50(3) f Appendix 



III. DEUTSCH-JOSZA ALGORITHM AND ALGORITHMIC FIDELITY 

There are currently several quantum algorithms that show speed-up over their classical analogs. 'iS^) Any one of these 
algorithms serve to investigate the merits of the exchange-coupled quantum dot implementation. We have chosen the 
Deutsch-Josza algorithm, a relatively simple algorithm requiring only a few logical gates (Fig. IS)). 

The objective of the Deutsch-Josza algorithm is to determine if an unknown function, / : {0,1}" i-^ {0,1}, is 
balanced (i.e. equal number of zeros and ones) or constant. For the three qubit Deutsch-Josza with its two query 
qubits and one answer qubit, this corresponds to 8 possible functions, six balanced functions and two constant 
functions. In Fig. El these functions are expressed in circuit diagram form. 

The existence of 8 different functions, and consequently of eight different versions of the algorithm, implies different 
fidelity values for a given initial state, pa{0) — p{0), for each version. For each version of the algorithm and for each 
choice of dephasing rate, we evaluate the fidelity according to 

F = Tr{po{tf)p{tf)}, (10) 

where Po{tf) is the density matrix describing system evolution in the absence of errors, and p{tf) the density matrix 
describing evolution in the presence of errors. Both poitf) and p{tf) are evaluated over the time tf required to 
complete that version of the algorithm. We define the algorithmic fidelity to be the worst-case fidelity, namely, for a 
given value of the dephasing rate, the fidelity of that version of the algorithm having the lowest fidelity. This provides 
a more conservative estimate than averaging over the 8 different fidelities resulting from each version of the algorithm. 
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FIG. 8: A circuit digram depiction of the Deutsch-Josza algorithm for three qubits. The two top most qubits are the query 
qubits and make up the domain for the unknown function / : {0, 1}" i— > {0, 1}'—', here indicated by f/ a unitary spanning all 
three qubits. For the three qubit Deutsch-Josza there are eight possible U's, all of which are given in Fig. |^ Evaluation of 
U is preceded and followed by Hadamard gates [H]. The result of the function evaluation is stored in the answer qubit, the 
bottom-most qubit in the circuit diagram, s^s y — \y (B f{x\,X2)), where xi and X2 designate the bit values of query qubit 1 
and 2 respectively. The outcome of the measurement of the query qubits, here designated by M, answers whether the function 
is constant or balanced. 
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FIG. 9: Circuit diagram representation of the eight different black box function evaluations (the U's of Fig. O inherent in the 
three qubit Deutsch-Josza algorithm. For 3 qubits the algorithm results in six balanced and two constant functions. 



IV. NUMERICAL METHODS 



As discussed in section m universal computation with the exchange interaction requires at least three physical 
qubits for every encoded logical qubit.'—) Thus, the Hilbert space grows as 8", where n is the number of logical qubits, 
and simulation of the density matrix becomes time consuming even for a few logical qubits. To permit consideration of 
larger numbers of logical qubits we use here the Monte Carlo wave function method.'^'*) This approach scales linearly 
with the size of the Hilbert space rather than quadratically as master equation methods. 



A. Monte Carlo Wave-functions 



The Monte Carlo wave function approach, also known as the method of Quantum Trajectories or the "Quantum 
Jump" approach, was originally developed within the quantum optics community. 'i^^tSSiS^) The method relies on 
a twofold approach to the system evolution. First, an effective Hamiltonian gives rise to a continuous non-unitary 
evolution of the system wave function. Second, decay operators, identical to the Lindblad operators within the master 
equation formalism'iSl) , give rise to stochastic discontinuities in the wave function. These stochastic discontinuities 
resemble the jumps one might expect from a single isolated quantum system. 

Both the effective Hamiltonian and the decay terms can be obtained from the Lindblad master equation. Ui) The 
conditional, or effective, Hamiltonian is given by 

Hcond — Hgyg — 2_ \ Cm^™-i (H) 



10 



where Hsys is the system Hamiltonian and the Cm's are decay terms resulting from the system-environment interaction 
with a subsequent tracing over the bath degrees of freedom. The total time evolution under Hcond is discretized and 
at each time step the probability of any collapse event is calculated: 

Ptot = J2 ^P"^ = E (*(i)ict„c™|vi/(t)). (12) 

This total collapse probability accounts for the occurrence of any error event, Cj^Cm, that collapses the system 
wave- function. The calculated Ptot is compared against a random number, r, taken from a uniform distribution. This 
is the first Monte Carlo test. A random number less than the total collapse probability, r < Ptot, designates that 
an error has occurred. Another Monte Carlo test, involving another random number, decides which error occurs. 
This second random number, s, is compared against the normalized collapse probabilities, SPm/Ptot, and that error 
is chosen that first makes the sum of the normalized error probabilities greater than the random number. Thus, upon 
completion of the two Monte Carlo tests the new wave-function is: 

mt+At)) = — -, (13) 

where Ci is the error operator randomly chosen in the second Monte Carlo test such that X]L=i ^Pm/ Ptot > s. 

On the other hand, if r is greater than Ptot, the system state is propagated according to Hcond and we obtain the 
system state at t -\- At: 

\^{t + M))' = eiqy{-iMH,ond/ri)\^{t)). (14) 

Since Hcond is non-Hermitian the norm decreases over time. To ensure equivalence with other approaches to simulating 
open quantum systems, e.g. master equations'^^ , the wave-function must be renormalized at the end of every time 
step: 

Upon renormalization a new total collapse probability is calculated and the entire algorithm begins anew for the next 
timestep. 

The time step. At, must be chosen such that Ptot *C 1, since for too large time steps a perturbative expansion for 
calculating the error probabilities is no longer justified. Each trajectory corresponds to a possible evolution of a single 
quantum system. The fidelity measure is based on the density matrix which can be regained by averaging over many 
trajectories. '<i25) Use of Ea.f llO|l leads to the following expression for the fidelity: 

^=T^ E \ iMtfmnitf)) \' . (16) 

Here l^^nitf)) is the wavefunction for trajectory n propagated with decoherence, and |^'o(i/)) is the wavefunction 
propagated in the absence of decoherence. Simulations are run with increasing numbers of trajectories until the fidelity 
converges. To ensure that the fidelity we obtain contains no artifacts or anomalies due to the choice of the initial 
system state, we sample a random distribution of initial states, all located on the surface of the hyperdimensional 
Bloch sphere of logical basis states. These Bloch states are given by: 



N-l 

iN-k 



Cfe(0)= exp{iipk)Y\f^Q'' cosOi Y]_ sin6'j 

j=N-k 

'Po = 0, eo= n. (17) 



B. Split Operator Method 



The dimensionality of the CNOT gate simulation on 6 physical qubits, a 2^ = 64 dimensional Hilbert space, is 
still small enough to permit the use of an exact diagonalization method to construct the conditional time evolution 
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operator from Hcond- However, the three qubit Deutsch-Josza algorithm requires nine physical qubits within the 
exchange-only model, and Hcond must then be exponentiated on a 2^ = 512 dimensional space. We have developed 
a more efficient method to construct Ucond = s.xp{—iHcondt/K), that proves computationally efficient for even larger 
Hilbcrt spaces. We make use of a split-operator decomposition of Ucond that is based upon the fact that Hcond, 
including decay elements, can be split up into a term diagonal in the spin components and a term off-diagonal in the 
spin components. The diagonal part can be expressed as 



N 

D = j 5,,, • s,., - 2ihrdep J2 ^1^^-.*' (18) 



and the off-diagonal part as 



J N 

T = - {s+sj + s+sj ) - 2m:cn.^ J2 s^^- . (i9) 



Here F^ep and Tcmi are the single spin pure dephasing and emission rates obtained from theoretical and experimental 
estimates and measurement32ii24i2iS&. 

The time evolution operator, Ucond, can then be expanded in an approximation accurate up to second order in At 
(errors 0{At^)) as: 

Ucond ~ exp {~iDAt/2h) exp (-iTAt/h) exp {-iDAt/2h). (20) 

The simulation of Ucond{t) now reduces to consecutive application of the exponentiated operators D and T. This can 
be efficiently done if states are represented by integers in binary notation, i.e. each spin is represented in the Sz basis 
by a or a 1 at location j in the binary representation of the state vector x — |fci...A:jv). The spin operators can be 
recast as binary shift and logic operations 

i(l - 2 * ibits{k,j, 1)) \kik2...k,...kN), (21) 

(1 - ibits{k,j, l))\kik2...ibits{k,j, 1) + l...fcjv), (22) 
ibits{k, j, l)\kik2...ibits{k, j, 1) — l.../cjv), (23) 

where ibits denotes a compiler (F90) command that extracts the value of jth bit in integer k. These binary operations 
are seen to act upon the system state vector in a manner analogous to raising and lowering operators. With this 
approach it becomes possible to simulate very large Hilbert spaces. This approach was employed previously in a 
checkerboard time propagation scheme for study of many body dynamics of interacting particles on lattices. 



Sz\kik2- 


. .kj . . .k]\[ 


S+\kik2. 


. .kj . . .k]\[ 


S^\kik2. 


. .kj . . .k]\[ 



C. Parameters 



Data with regards to decoherence parameters for exchange coupled quantum dots is scarce. We have used experi- 
mental parameters to the extent possible. Where none are available we have interpolated, using theoretical estimates, 
between what is experimentally known and the requirements of our simulations. In general, experiments in condensed 
matter physics have indicated that the electron spin states, because of their weaker coupling to the environment, 
exhibit longer coherence times than the charge states. Due to difficulties involved in measuring single spin states, 
however, the majority of these experiments provide us with a ensemble measurement of the lifetime, and are thus 
not directly applicable to a system of single spins'i^iiiS^ We employ here the inequality relationship Ti > T2 > Tj*, 
where Ti describes the time scale for the spins' exchange of energy with the surrounding matrix, T2 is the single spin 
decoherence time, and T2 is an ensemble decoherence time which, in addition to contributions from Ti and T2, also 
contains effects due to inhomogeneities in the system, to the surrounding matrix, and to the control fields. 'li^-' Taking 
into account the single dot Ti times obtained by Fujisawawa et al.^^ , we arrive at a set of reasonable decoherence 
parameters: a dephasing rate on the order of ns and a timescale of /xs for emission and absorption, both of which 
involve spin flips. Consequently, in a system like ours, where the strength of the exchange coupling is assumed to be 
on the order of 0.2 meV, we find dimensionless decoherence rates KT/Jq — 10^^ — 10"^. Additionally, we find that 
dephasing errors dominate over emission events, according to = — 10^- Consequently, pure dephasing is 
a greater concern than spin flip errors, and will thus constitute the main focus of our simulations. In the context of 
decoherence, it should be noted that a reduction of some of the decoherence pathways may be possible with the use 
of experimental techniques such as spin polarization and spin echo, that have been developed for other systems such 
as NMR.^^-'^' Our simulation does not include these potentially very beneficial modifications. Note that the encoding 
in Eq. Q is automatically protected against collective dephasing, but not against independent single spin dephasing. 
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RESULTS 



A. Exchange-only CNOT, in serial mode 



The encoded exchange-only CNOT is the first unitary operation we investigate. Fig. EH shows the fidehty over the 
19 gate implementation. We see that for a dimensionless dephasing rate of fiT / Jq — 10^'^ the probability of perfectly 
performing a CNOT is - 98%. 
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FIG. 10: Fidelity simulation for CNOT gate, subject to dephasing errors, for a linear quantum dot array, where three physical 
qubits encodes each logical qubit. o 19 exchange gate implementation of CNOT gate, o Free system evolution for duration 
equivalent to CNOT. 



Burkard et al. have indicated a possibiUty that actual implementation of the gate, i.e. turning on the Jq coupling 
between adjacent quantum dots, might result in faster decoherence.'^) We therefore used the free system evolution 
under identical conditions of dephasing as a point of reference. We find that gate implementation does result in faster 
decoherence, but that the effects only become appreciable at higher dephasing rates, HTdep/Jo — 10^^ fFig. lTUI) . We 
also compared the fidelity obtained for the encoded CNOT gate to a standard CNOT gate between two physical 
qubits. To reduce the effects of method and parameter choice, we used the same values for the dephasing rate to 
interdot coupling strength ratio, TiTdep/Jo, and took the timescale for the CNOT gate to be the same as for the 
exchange coupled qubits. This comparison is shown in Fig. 1111 

Fig. II II shows that the performance of the encoded CNOT gate deteriorates faster with increasing dephasing rates 
than does the bare CNOT gate. We recall that the timescale for coherence loss due to dephasing decreases with the 
number of qubits. As shown in Ref. '2? the master equation for a single qubit under pure dephasing can be written 
as: 



m 



lUJ 



'-{(JzP(t)cr,_ - p{t)), 



(24) 



Tr 



2 ' " 2 

where w is the shifted frequency of the two level system and F^ep is the dephasing rate. The dynamics of Eq. H24l can 
be solved analytically and for this 2-level system the fidelity (see Eq. ((11])) ') is obtained as: 

Poo(O) Poi(0)e-™* \ ( poo(O) poi(0)e-™*-i^--* 
Pio(0)e™* pii(O) ) \ pio(0)e™*-r-p* pii(O) 

= pIM + P\M + e-^-''*(|poi(0)P + |pio(0)n (25) 

This state-dependent fidelity must now be integrated over all possible initial states to obtain the algorithmic fidelity. 
Employing a general state ^'(0) = cos ||0) -I- e*"^ sin ||1) and integrating over all possible states on the surface of the 
Bloch sphere leads to the average fidelity: 

rr{po(i)p(0}sin(6')d6'd(/) (26) 
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FIG. 11: Fidelity simulation for a CNOT gate, subject to dephasing errors, for an encoded linear quantum dot array and 
between two physical qubits. o 19 exchange gate implementation of CNOT gate. A CNOT gate implemented between two 
physical qubits, with gate time equal to 19 exchange gates. 



2 + e'^-^^p* 
" 3 ' 

This average fidelity asymptotically approaches 2/3 for a single qubit and (|)^ for N independent qubits as t oo. 
Now the CNOT gate involves couplings between qubits, and hence the long-time CNOT gate fidelity dependence 
upon the number of qubits will not be exactly (|)^. However the overall faster decay of the fidelity as TV increases 
is still found. This is also evident in Fig. ^] 
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FIG. 12: Fidelity simulation for CNOT gate, subject to emission errors, for a linear quantum dot array, where three physical 
qubits encodes each logical qubit. o Nineteen exchange gate implementation of CNOT gate, o Free system evolution for 
duration equivalent to CNOT. 

The effects of emission upon the CNOT gate fidelity are summarized in Fig.^| We see here a greater degeneration 
as a function of the emission error rate. Emission events are intrinsically more detrimental to the proper operation 
of our quantum device than are dephasing errors. They signify a change in the system's overall energy. In contrast. 
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dephasing errors merely introduce a random phase difference between the ground and excited states. Note, under the 
independent error model used here the logical basis states do not lie in a decoherence free subspace, which would have 
been the case for a collective error model. *^ Thus, both emission errors and dephasing errors will take the system 
outside the encoded subspace (S**"* — 1/2). We recall that emission events are generally a much rarer occurrence 
than dephasing events. As mentioned in section IIV CI the expected ratio of dephasing to emission in semiconductor 
quantum dots is = ~ 10^ - 10^. In contrast, for HTdep/Jo ^ 10"^ we have hTemz/Jo ~ 10"^ - 10"*^. As 
seen in Fig. ^] (inset), in this regime the encoded gate fidelity is > 95%. 



B. Algorithmic Fidelity of Three qubit Deutsch-Josza 

For the the three qubit Deutsch-Josza algorithm there are eight possible function evaluations, listed in Fig. |51 
(Sec. Illljl . We found an algorithmic fidelity F > 0.70 for dephasing rates hTdep/Jo < 10^'^ (increasing to F > 0.98 
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FIG. 13: Simulation of three qubit Deutsch-Josza algorithm, where the fidelity plotted is the worst case fidelity, i.e the lowest 
calculated fidelity for any of the eight possible functions. 



for hTdep/Jo < 10 ^). The algorithmic fidelity is shown as a function of HTdep/Jo in Fig. 1131 



C. Using N-qubit unitaries to simplify algorithm implementation 

From the perspective of quantum computing, another approach to overall time reduction is possible. As noted in 
Refs.lsslandls^ any sequence of logical gates may be replaced by a single N-qubit unitary. Thus, in Sec.^it was shown 
that the sequence of four adjacent qubit CNOTs is equivalent to CN0T(l,3).We may similarly reduce combinations 
of other single and two qubit-gates to just one N-qubit gate involving only 2 body interactions. Certainly, many of 
these N-qubit gates will not be as simple as the CN0T(1,3). They may nevertheless allow for a faster, more efficient 
experimental realization of certain combinations of gates. 

We have analyzed this approach for the example of a CNOT sandwiched between 4 Hadamard gates (Fig. I14|l . 
This circuit, which can readily be verified to be equivalent to a CNOT with the control and target qubits reversed, 
constitutes a relatively simple two-qubit unitary where it is possible to find an analytic solution for exchange-only 
implementation, using the quaternion decomposition described in Appendix IXI Starting with the original unreduced 
version (3 -I-1-I-1-I-3-I- 19 -1-3-1-3 = 33 exchange gates) of the exact analytic CNOT sequence shown in Fig. El 
the exchange gates corresponding to the Hadamard gates are then added before and after this sequence to arrive at 



a 45 exchange gate sequence for the desired 2-qubit unitary, shown in Fig. 15(a) I. The length of this sequence can 
be reduced by using the relation = I for the Hadamard gate and by combining the gate times for consecutive 
exchange gates acting on the same qubit pair, as was done for Fig.|^ This yields the 31 exchange gate long sequence 



shown in Fig. 15(b) The corresponding exchange gate times are listed in Table iHIl In more general cases of A^-qubit 



15 



1 — 


H 




H 






2 — 


H 




b- 


H 









FIG. 14: The 2-qubit unitary consisting of a CNOT sandwiched between 4 Hadamards. 

unitaries involving many consecutive two-qubit and one-qubit gates, this analytic approach might not be feasible and 
numerical optimization techniques such as that described in Section [ll Al will then have to be used. 

The sequence of gates shown in Fig. ^| contains 31 gates. This analytic solution should be compared against 
the 42 gates required for implementing the gates consecutively using 4x3 exchange gates to represent the four 
Hadamard gates and 30 exchange gates to represent the CNOT. One expects that with fewer gates and shorter total 
implementation time, better fidelities would result. Fig. 1161 shows that this is indeed the case. The fidelity for the 
shorter sequence is about 5 to 10 percent better at decoherence rates {fiTdep/Jo) ^ 10^^, and the improvement is 
even greater for faster decoherence rates. 

VI. SUMMARY AND CONCLUSIONS 

We have investigated here the merits of exchange-only quantum computation based on a linear quantum dot array. 
For this architecture, we have shown that it is possible to achieve fidelities of 95 % or greater for the CNOT gate 
with realistic choices of system parameters. We have also elucidated the performance under dephasing of the 3 qubit 
Deutsch- Josza algorithm that is implemented with a 3 qubit encoding. For this algorithm we obtained a fidelity of at 
least 0.70 for realistic dephasing rates, KTdep/Jo < 10~^. In addition, we have provided an example and supporting 
simulation data for replacing a series of gates with a single unitary. This approach is advantageous because it reduces 
the total time for implementing the algorithm, and will thus thereby also reduce the effects of decoherence. 

Our results indicate that, due to the currently rather high decoherence rates, achieving the 10""* threshold required 
for fault tolerance is beyond present capabilities in this spin-coupled quantum dot model. Nevertheless, the success 
probabilities for the Deutsch-Josza simulations imply that exchange coupled quantum dot arrays make for an inter- 
esting testbed. With improved experimental solid state technology, greater gate fidelities can be expected. Getting to 
KTdepI Jo = 10~^ will yield > 98% algorithmic fidelity fFig. I13() . In the context of extending the relevance of these 
simulations, and how they pertain to other systems, it should be noted that a perfectly isotropic interaction is not 
a necessity for universality, as it has recently been shown that both the anisotropic and asymmetric interactions are 
universal under appropriate encoding. Ui) 

We have attempted to provide here a realistic estimate of gate and algorithmic fidelity for exchange-only quantum 
computation. Our estimates could be improved by having more realistic single spin parameters and by incorporating 
pulse shaping techniques. The square pulses assumed here provide only an approximation to experimental pulses. 
However, since the ability to implement SU{4) and SU{2) operations is only dependent on integrated pulse shape, 
square pulses are adequate from a theoretical perspective, provided that the qubit is defined on a pure two-level 
system and a square pulse therefore cannot cause excitation to higher levels. In the future it would be desirable to 
perform simulations where the pulses better reflect what is achievable in the laboratory. Allowing for pulse shaping 
and employing chirped pulses have been shown to improve both gate and algorithmic fidelities*i^\ making such 
simulations doubly interesting for future work. 
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(a) Adding the exchange-only representation of the Hadamard gates onto 

the exact CNOT sequence of Fig. [^results in a 45 gate exchange 
sequence that implements the 2-qubit unitary of Fig. 1141 This sequence 

can be reduced by adding the gate times for consecutive exchanges 
acting on the same qubit pair and by making use of the relation = I 
for the Hadamard gate to arrive at sequence b) below. 
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(b) Resulting shorter sequence of 31 exchange gates that directly 
implements the 2-qubit unitary of Fig. 1141 The corresponding gate times 
are given in Table ITTTl 
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FIG. 16: Comparison of fidelity obtained using a standard serial implementation of the logical gates with 42 exchange operations, 
versus the 31 gate sequence shown in Fig. 1151 The simulations employed 20 time steps per exchange gate and 25600 trajectories 
averaged over at least 64 input states sampled from the surface of the hyperdimensional logical Bloch sphere, Eq. 1171 Open 
circles (o): the normal serial implementation of the logical gates in Fig. 1141 (42 exchanges). Open triangles (A): the overall 
unitary gate is decomposed into individual exchange gates according to Fig. 1151 f31 exchanges). 
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FIG. 17: The angles /3 and 7 are used to define the axis of rotation when using hyperspherical coordinates. 
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APPENDIX A: QUATERNIONS 

First developed by Hamilton, quaternions provide an alternative to the normal matrix representation of vectors 
and rotations in SO[?,).^^^^ 

A general quaternion, q = {w, ga;, gy, ^z} = {w,?/}, is just a four component array which can represent cither a 
vector in 3?"^ or an S'0(3) rotation. The ffist component represents a scalar and the last three components a vector. 
More specifically, vectors and rotations take the form: 

V ^ {0, Uj;, Vy, V^} 

q-r — {cos(a/2), ra;Sm(a/2), rySin{a/2), rzSin{9/2)} (Al) 

Here qr corresponds to a rotation of a around the Cartesian vector f — {r^, ry, r^). Using hyperspherical coordinates 
this vector, and the resulting quaternion, can be written as 

f = (sm(/3/2)cos(7/2), sin{P/2)sin{j/2), cos(/3/2)) 
qr ~ {cos{a/2), sin{a/2)sin{(5 /2)cos{"f /2), 

sin{a/2)sin{f3/2)sin{j /2), sin{a/2)cos{P/2)}, (A2) 

where /? and 7 define the axis of rotation, as shown in figure [T7I and a is the angle of rotation around this axis. Two 
quaternions are multiplied together to form a new quaternion: 

qi*q2 = {-iwi,"!} * {^2,^2} 

= {wiW2 — ILl ■ 1*21 WliJL2 + 'W'2Ul + ILl 'X 1L2}- (A3) 

Hence one can derive expressions for sequences of rotations, Q = qN * Qn-i * ■•• * (72 * Qi- For example the sequence 
of Euler angle rotations'^'^*-' R{(f>,9,x) — e*^~*'^^'^^e'^~*'^«^^e^~*'^^-^^ becomes in the quaternion representation: 

sin{^-)cosi^ _ |), co.(^).zn(| + |)}. (A4) 
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The Heisenberg Hamiltonian provides us with two rotations on the encoded qubit, Oz and a^'!^ 

The first is a rotation around z, the second a rotation around the axis k = Vi/2x + l/2z. Note, as seen from Eq. 12 
using Eij rather than Hij to calculate gate sequences and gate times just results in a global phase in the final state, 
which can be accounted for as follows: 

M M M 

Jl exp {it',,E,,j,) = exp {iJo/AhJ^tk) H '^xp {itkH,,jJh). (A6) 

k=l k=l k=l 

Where t'/. = results from the rescaling necessary when using Eij instead of Hij. 

Given the mapping of SU{2) to 50(3) (the Bloch sphere representation for spin-1/2 systems, see Fig.HSjCS^, these 
exchange gates (Eq. IA5II can be cast in the quaternion representation respectively as 

qi{t) = {cos{t/2), 0, 0, ~sin{t/2)}, 

q2{t) = {cos(i/2), ^sin{tl2), 0, \sin{t/2)}. (A7) 



z 




FIG. 18; The Blocli sphere representation of a spin 1/2 particle.'—^'' 



Having obtained the quaternions that correspond to the different possible exchange gates on our encoding, and 
understanding how these quaternions can be multiplied together, we can now investigate the number of exchanges 
required to generate certain single qubit gates. 

Using the geometric representation for generation of the SU{2) group of rotations'i^242i) we investigate how many 
sequential implementations of our exchange gates, exp(rfi5i2) and exp(iii?23), suffice to generate all possible single 
qubit rotations. Projected onto the x — z plane of the Bloch sphere, the rotations around z and ^/i/2x + l/2z 
corresponding to these exchange gates can be represented as in Fig. ^1 

Based on this geometric interpretation for the action of the exchange gates, we can proceed as in Ref. and 
determine the minimum number of exchange gates required to generate any SU (2) rotation from a given pair of 
exchanges. To do this we consider how many exchange gates, and in what order they should be arranged, suffice to 
rotate a state from the south pole all the way to the north pole. As shown in Fig. ^1 this extreme rotation can be 
achieved using 3 exchange gates in the sequence exp {it^E2'i) exp {it2Ei2) exp (iti£'23). This rotation from one pole to 
the other is the hardest to achieve, in the sense that it requires the most changes of direction and hence the greatest 
number of exchanges. All other rotations will require equal or less exchanges. 'i^^ To this sequence of 3 exchange gates 
we now add a fourth exchange gate, namely exp {it4^Ei2) in order to allow for an arbitrary phase to be obtained when 
the state is located at the north pole of the Bloch sphere. This extra gate corresponds to the first CTz rotation when 
a similar decomposition is considered for the Euler angle construction for rotations in SU{2) fEa. IA4p . We now use 
the quaternion approach to find explicit exchange sequences for several elementary gates. 
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FIG. 19: Planar projection of the Bloch sphere onto the x — z plane, depicting the rotations corresponding to exp {itEi2) and 
exp(itE23). Implementing E12 is the same as rotating around the z axis, the action of which causes states to follow a line 
parallel to the x axis. The E23 gate corresponds to rotations around the axis\/3/2a; + 1/2 z, and states then follow a line 
perpendicular to this axis during the duration of the gate. Combined in the sequence exp (it3iJ23) exp (if2-Bi2) exp (jti_E23) 
these two exchange gates allows us to reach any point on the Bloch sphere starting from any arbitrary point. The figure shows 
how this sequence of gates allows us to rotate a state from the south pole to the north pole, i.e. to the most distant state. The 
ability to generate all other rotations follows directly from this.'—' Note, once at the north pole it might be necessary to adjust 
the phase of the state. Thus we need to add one more i?i2 gate to the sequence, arriving at a total of at most 4 exchange gates 
to generate any SU{2) operation. (Note that this final -E12 gate serves exactly the same purpose as the final rotation does 
in the Euler construction.) 



i) Similarity transformation for exact CNOT, S. In Section III A|l it was shown that U^not"^"^'^ '^^^ diagonalized as 
D = U^^^i"'"'^'^ S . Here S* = / ® — ^ay which corresponds to a rotation of 60° about the y axis on the second 

qubit. The action of S on the second qubit can be written as cos{^^)I — i sin{^^)ay. Referring to Eq. IA2I it is 
evident that in the quaternion representation of rotations in 5*0(3) this corresponds to 

g = {^, 0,1,0} (A8) 

For sequential application of two exchange gates, exp {it2Ei2) cxp {itiE23) or exp {it2E2z) exp {itiEi2), consideration 
of the trigonometric equations that define the different components of the resulting quaternions shows that with 
just two exchange gates it is impossible to satisfy the requirement that both the x component and z component 
of the resulting quaternion be simultaneously zero (Eq. IA8|1 . However, with a sequence of three exchange gates 
(exp {it'iEi2) exp (?i2£^23) exp {itiEi2)), which in the quaternion representation corresponds to 

qs{h,t2,h) = {cos{t2)cos{ti + tj,) +^sin{t2)sin{ti+t3), 
^sin{t2)cos{ti - ts), 
^sin{t2)sin{ti ~ ig), 

—cos{t2)sin{ti + t-^) + —sin{t2)cos{ti + t:^)}, (-A-9) 

we find that a solution for S can be obtained. This is accomplished as follows. We need the x component of qs to be 
zero and the y component non-zero (Eg. IA8I) . Thus we set <i — is ~ Tr/2 and solve for t2 from 

^s^nit2) - i (AlO) 
We then use the first (scalar) component of qs to solve for ti + from 

2 1 \/3 

-cos{ti+t3)+ ^smiti+ts)^—, (All) 
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which can be rewritten as 

9 . „. ( f \ ^ 

(A12) 




where e = sin{t\ + is). Solutions of Eas. lAlOl and IXl2l yields t\ — asm(l/3)/2 + 7r/4, t2 = asm(l/\/3), and 
H = asm(l/3)/2 - 7r/4. 

After obtaining the three-exchange representation of S*, we still have to find the single qubit gates that transforms 
the resulting matrix D into the C-PHASE gate. Since C-PHASE, like Z?, is diagonal in the computational basis, a 
combination of Uz rotations and a global phase is sufficient to transform one into the other. To find the relevant 



rotation angles, and 9 (Fig. 5(a) I, and the global phase factor, fi, we set up the following system of equations 

n + e + (i) + Am = 

r2 + 6'-0 + Aoi = 

r^-^ + ^ + Aio = 

r2 - 6* - + All = TT. (A13) 

Here the A^'s are the arguments of the diagonal matrix elements of D when these are written as phases, 
i.e. exp(iAy) = Dij, and the other terms on the left are obtained from the diagonal elements of the matrix 
exp (if2)ea;p(i(0 (Jz (X) / + (/)/ (8) iTz)). Solving for these variables, one finds that — 0.612497 and 9 — —0.547580. 
To recast these in terms of exchange gates it is enough to realize that implementing E12 is equivalent to a Uz rotation 
(see Eg. IA5|) . Thus, the exchange gate times corresponding to 9 and are tg = 9 and t^f, — (p, respectively. For the 
tt/8 gate the same argument trivially yields a single exchange gate time of i^r/s — tt/S. 

ii) Hadamard gate. Using the same sequence of three exchange gates, exp (it3£'i2) exp (1^2-^23) exp (iiii?i2), and 
realizing that for the Hadamard gate it is the y component of the resulting quaternion that must be zero, we can find 
the solution for the Hadamard gate in an identical fashion. This results in a quaternion representation: 

QH [ti, t2, h) = QH (^-atan{V2)/2, asin{^/2j3), -atan{V2)/2j . (A14) 
in) NOT gate. The NOT gate has a quaternion representation 

QN - {0, 1, 0, 0} (A15) 

which corresponds to a full rotation from the south pole to north, or vice versa, when interpreted on the Bloch 
sphere. For this gate the three exchange gate sequence considered above is insufficient to generate the gate, since 
this sequence can never result in an x component greater than V3/2 (see Eg. IA9II . whereas the NOT gate requires 
an X component of 1. We can find a solution for the NOT gate using the modified three exchange gate sequence 
exp (it3i?23) exp (it2i?i2) exp (iii-B23)- This sequence leads to the following expression for the corresponding quater- 
nion: 

qN{ti,t2,t3) = {cos{t2)cos{ti + ts) +^sin{t2)sin{ti+t3), 

^^cos{t2)sin{ti + t^) + sin{ti) sin{t2) sin{t3) , 
-^sin{t2)sin{ti - ts), 

-^sin{t2)cos{ti ~ ts) + ^cos{t2)sin{ti + t^) 

-~-cos{ti)sin{t2)cos{t3)}, (A16) 

We can then solve for the times t^, s = 1 ~ 3, by first recognizing that since the y component must equal and the 
X component must equal 1 then, ti — t^ = 0. Making use of this and the expressions for the x and z components, 
we can then solve for ^2 from ^2 = asin{l/^/3). Substituting this back into the expression for the scalar component, 
which also must be equal to zero, we then obtain the corresponding values for ti and ^3 as ti = = atan{^/2) . 
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TABLE L Exchange gate times for the fuU sequence for CNOT in the computational basis, Eq. |S| determined by numerical 
optimization. Exchanges 1-8 correspond to Ui0 U2, exchanges 9-27 to f/^^^f and exchanges 28-35 to Vi ® V2 (see Fig.^J- 
Starting from the gate times for exchanges 9-27 given in Ref. [l|, a Nelder-Mead optimization routine was used to find the 
additional gate times corresponding to gates 1-8 and 28-35. The optimization criteria was minimization of the the cost function 
of Eq. El over the 16-dimensional parameter space of exchanges 1-8 and 28-35. The sequence of 35 gates and associated gate 
times seen here represent the first successful run where the cost function dropped below the required tolerance of 10^''. All 
gate times are given as positive numbers modulo tt, in units of 2h/jQ. 
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TABLE II: Exchange gates and times required to transform the exchange CNOT gate into the exact CNOT gate in the 
computational basis, Eq. (|HJ, obtained with analytic solution of the local transformations Ui,Vi,i = 1,2. Exchange gates 1-6 
correspond to Ui ® U2, exchange gates 7-25 to U^^^^"'"^'^ , and exchange gates 26-30 to Vi ® V2 (see Fig.jSJ. Exchanges 7-25 are 
taken from Ref. Q. The remaining gates (1-6 and 26-30) were arrived at analytically by decomposing each local unitary into a 
sequence of simpler rotations as described in the text, and then using the quaternion representation to find the corresponding 
rotations in 5*0(3). See Appendix^for full details. All gate times are given as positive numbers modulo tt, in units of 2%/ Jq. 



Exchange 


Qubit 


Qubit 




Exchange 


Qubit 


Qubit 




Number 


1 


2 


Time 


Number 


1 


2 


Time 


1 


4 


5 


2.663935 


16 


1 


2 


2.639495 


2 


5 


6 


0.955317 


17 


2 


3 


1.302882 


3 


1 


2 


0.612498 


18 


3 


4 


0.463868 


4 


4 


5 


1.161038 


19 


2 


3 


2.554511 


5 


5 


6 


2.526113 


20 


4 


5 


0.871873 


6 


4 


5 


0.615480 


21 


1 


2 


1.249644 


7 


3 


4 


1.290877 


22 


5 


6 


2.107472 


8 


2 


3 


0.650655 


23 


2 


3 


2.554511 


9 


4 


5 


0.871873 


24 


4 


5 


0.871873 


10 


1 


2 


1.934484 


25 


3 


4 


1.290877 


11 


5 


6 


2.107472 


26 


4 


5 


2.526113 


12 


2 


3 


0.650656 


27 


5 


6 


0.615480 


13 


4 


5 


0.871873 


28 


4 


5 


0.477659 


14 


3 


4 


2.012206 


29 


5 


6 


0.955317 


15 


2 


3 


1.302882 


30 


4 


5 


2.663935 



24 



TABLE III: One possible set of exchange gates and times that implements the two-qubit unitary of a CNOT sandwiched 
between four Hadamards fFig. 1141 equivalent to a CNOT with target and control qubits reversed). Starting with the 19-gate 
exchange-only CNOT sequence for U'^Zt''"^'' ol Ref. '7, the additional local gates were then determined analytically using the 
quaternion formulation summarized in Appendix^ All gate times are given as positive numbers modulo vr, in units of 2h,/J(j. 
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